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ABSTRACT 

The gravitational wave signal arising from the collapsing iron core of a Type 
II supernova progenitor star carries with it the imprint of the progenitor's mass, 
rotation rate, degree of differential rotation, and the bounce depth. Here, we 
show how to infer the gravitational radiation waveform of a core collapse event 
from noisy observations in a network of two or more LIGO-like gravitational wave 
detectors and, from the recovered signal, constrain these source properties. Using 
these techniques, predictions from recent core collapse modeling efforts, and the 
LIGO performance during its S4 science run, we also show that gravitational wave 
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observations by LIGO might have been sufficient to provide reasonable estimates 
of the progenitor mass, angular momentum and differential angular momentum, 
and depth of the core at bounce, for a rotating core collapse event at a distance 
of a few kpc. 

Subject headings: gravitational waves — methods: data analysis — supernovae: 
general 



1. INTRODUCTION 
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pact part of the source mass distribution. From the observed waves we have the potential 
to infer the factors that govern that evolution. For example, the evolution of the collapsing 
stellar core in a type II supernova is determined in part by the progenitor mass density and 
angular momentum distributions in the inner core. None of these properties of the progenitor 
can be directly determined from the electromagnetic radiation we observe when the shock 
emerges from the stellar envelope. In addition, the forces that govern the evolution of the 
core depend upon many things that are unknown, or poorly modeled, today (for example, 
the matter equation of state, the role played by neutrinos and neutrino radiation transport, 
general relativity, convection and non-axisymmetry) . The gravitational waves emitted dur- 
ing the collapse phase and its aftermath carry the signature of all these preconditions and 
the dynamics that govern both the collapse and subsequent rebound. Reading that signa- 
ture requires inferring the waveform from the noisy detector observations. Here, we develop 
a new method for inferring the gravitational radiation waveform from th e noisy data fr om 
two or more detectors, based on Jaynes' principle of maximum entropy ( Jaynes 1957bl Jah . 
demonstrate its effectiveness when applied to the discovery of signals arising from simulated 
rotating iron core collapse buried in simulated detector noise, and show how the inferred 
waveform can be used, in principle, to gain insight into the properties of the source. 

The first problem addressed in this study is the problem of inferring the gravitational 
wave signal from the data produced by the detectors, also referred to as the deconvolution 
or inverse problem. Inverse proble ms generally have bee n long-recognized as a problem to 
be approached with great care (see lEvans fc Stark! |2002| for a recent review). The detector 
response, which relates the incident gravitational wave to the signal observed in the detector, 
is generally an ill-conditioned function; additionally, the presence of additive noise generally 
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confuses the observation. Together, the noise and the ill-conditioned detector response gen- 
erally thwart naive attempts at signal deconvolution. Early attempts at gravitational wave 
signal decon yomtion explored a lea st-squares or maximum likelihood approach to deconvo- 
lution: e.g., iGiirsel &: Tintol (119891 ) developed a procedure for inferring a plane wave signal 
incident on a network of detectors with a frequency-independent response, in which case the 
problem response is well-conditioned. These techniques tend to over-fit the observations; 
additionally, realistic detectors have frequency responses that are ill-conditioned and this 
complicates a least-squares or maximum likelihood approach to deconvolution. More recent 
efforts hav e explored regulari zed methods for inferring the incident gravitational radiation 
waveform (jRakhmanovl 120061 ) . Here, we use the maximum entropy principle to regularize 
the deconvolution problem, developing an application to the gravitational wave inference 
problem that is applicable in the general case of a frequency dependent and ill-conditioned 
detector response and avoids over-fitting the observations in the presence of additive detec- 
tor noise. Going beyond the problem of inferring the incident wave, we use the inferred 
waveform to explore how, and how well gravitational wave observations of this kind can be 
used to learn about the source. 
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itage to the simpler problem of reconstructing the time-dependent plane wave signal incident 
on a network of gravitational wave detectors from their time series response. 

Once we have inferred the incident signal, we are faced with a second problem of in- 
ferrence: identifying the properties of the source from the signal. One way of associating 
gravitational waveforms with supernova properties is to compare the inferred waveform with 
models arising from simulations that explore the signal dependency over a broad range of 
physical parameters. We should expect the inferred waveform to have the most in com- 
mon with the simulated waveforms arising from models whose character is most similar to 



^dn all these applications the Principle of Maximum Entropy actually plays a relatively small role: much 
more important is Bayes Law and the understanding that probability can represent degree of belief or 
certainty in the state of a system; nevertheless, the term Maximum Entropy has come to refer to all these 
methods of deconvolution and we continue that tradition. 



-4- 



the actual source. Here, we compare, using the cross- correlation, inferred waveforms with 
simulated waveforms arising from different rotating core collapse models. We assume that 
models whose signal shows the greatest correlation are the ones most likely to be similar to 
the source|§ 

Section [2] describes in detail the development of the inference Bayesian inference method 
that we have developed to identify the time-dependent gravitational wave signal incident on 
a network of detectors, ending with a demonstration of the method applied to simulated 
observations made at an idealized two-detector network. Section [3] reviews the expected 
gravitational- wave emission processes in core-collapse sup ernovae and desc ribes the recently 



produced catalog of rotating core-collapse waveforms by lOtt et al.l (120041 ) and the physics 
that went into the considered models. We use these simulated core-collapse signals to char- 
acterize how well our procedure for inferring the incident signal and characterizing the source 
works. Finally, in § H] we summarize our conclusions and directions for further study. 

One important goal with this paper is to connect the two communities of gravitational 
wave experimentalists and supernova modelers in a way that has not been done in the past. 
For this reason we have taken particular care to make the discussion of § [2] pedagogical in 
nature. Future gravitational wave models can be put through the pipeline established with 
this paper so that we can obtain more credible estimates of what might be possible with 
either initial or advanced LIGO, Virgo, GEO600, or TAMA300, or with any combination of 
these, for any theoretical model. This has simply never been done for any collapse models, 
and is a novelty of this paper that we hope will stimulate further interactions between 
astrophysicists doing supernova simulations and the gravitational wave detection and data- 
analysis communities. 



2. A Bayesian Approach to Deconvolution 
2.1. Introduction 

The first problem faced in the analysis and interpretation of astronomical data is the 
identification of a signal in noisy observations. When the signal being sought takes a known 



Note the distinction between correlation and the more involved matched filtering; (jFinnl Il992t 



Finn fc Chernoflll993h . Matched filtering is a useful analysis technique when the functional form of the 



signal being sought is known precisely. Here, we presume that we have only a qualitative model of the signal 
dependence on the physical parameters of interest. In that case, using the full apparatus of matched filtering 
could very well lead us to reject real signals because the match to the model is only qualitative. 
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form then special techniques, tuned to the characteristics of the signal, may be used; however, 
the general problem remains the same: the first requirement is to identify the characteristics 
of the radiation h incident on the detector(s) that gives rise to the observational data d. 
The data may be from a radio antenna, a spectrograph, or a network of gravitational wave 
detectors; our goal may be to map of the radio emission from a distant AGN, the emission 
spectra of an accretion disk, or the burst of gravitational waves associated with a core- 
collapse supernova. In the case of gravitational wave data, which is our principle interest 
in this paper, the data d are a set of time series corresponding to the sampled output of 
each detector, one time series from each detector in our network; the radiation h is the time 
dependent gravitational wave strain incident as a plane wave on the detector (s), which may 
be characterized (for example) by the direction of wave propagation and the time dependent 
strain amplitudes h + (t) and h x (t) in the two polarization states as measured at the Earth's 
barycenter. Restricting attention to linear detectors (which is almost always the case and 
thus hardly a restriction at all) the observations in each case are a linear superposition of 
a confounding noise and the incident radiation as filtered by the detector response function 



where Rh is the convolution of the detector network response with the incident radiation 
and n is the noise contribution to the observation d. We generally know R and the statistical 
properties TV of the noise n; with these our aim is to determine h from d. 

In the absence of noise the solution to this problem appears, at first blush, to be straight- 
forward: i.e., we solve for h the system of equations [T] with n = 0. Problems arise when 
R is not invertible, either because we have too much or two little data (i.e., the problem is 
over- or under-determined), or because the detector (s) is (are) insensitive to some aspects 
of the signal h (e.g., the detector cannot distinguish between polarization states of h, or is 
insensitive to signal energy outside of some band). In these cases we are forced to deal with 
uncertainty: if the data d over-determine h we must be prepared to resolve contradictions 
within the data; if d under-determine h any conclusions we reach must be tempered by our 
incomplete knowledge of h. 

These problems are only compounded when we consider the real case of noisy data. 
In real problems the response function R is always ill-conditioned: i.e., the response is not 
invertible (the problem is over-determined), or not unique (the problem is under-determined), 
or the solution h to an equation of the form Rh = d is sensitive to small perturbations in 
d. Sensitivity of h to perturbations in d presents a new twist: even when when the noise 
contribution to the observation d is numerically small, it is dangerous to assume that R _1 d is 
close to h. Finally, even in those cases where the response is invertible and well-conditioned, 
applying the inverse to d does not distinguish between the signal h and a "signal equivalent 



R: 



d = Rh + n 
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noise" R n. 

Framing the problem of deconvolution in the language of Bayesian inference provides 
guidance on how to proceed in determining h. In a Bayesian approach to the problem of 
inference we seek not h, but a probability distribution 

(probability density that h' is incident on the detector(s) \ 
given data d, response R, noise characterization . (2) 

A/", and other unenumerated assumptions X. / 

The probability density /, if we can find it, fully describes our legitimate knowledge - 
including uncertainty — regarding the waveform and its properties: e.g., we can compute 
from it our expectation of the gravitational wave power incident on the detector, etc. It 
also provides a good point estimate of the incident radiation: i.e., the h' that maximizes 
/(h'|d, R, N,T). We can report / or, as is more generally the case, report some summary 
of the distribution /: e.g., the h' that maximizes / and some suitably defined "error bars", 
which summarize the degree of our uncertainty. 

Following Bayes Law, the probability distribution / may be "factored" into the product 
of three other distributions: 

ffh'ld RAfD- 4(d|h',R,A/-,J)g(h'|J) 

/(h |d,R,A/, J) v(d | R> ^ >2) > ( 3a ) 

where 

<?(d|h', R, N,T) = ( likelihood function for d given h' ) (3b) 

q(h!\T) = (a priori expectations regarding the incident wave.) (3c) 

, 1|T1 / normalization constant equal to the probability \ ,„ lN 

v(d\R,Af,l) = , \ r ) 3d 

\ of observing d given response R and noise TV / 

The probability v is independent of h. We can ignore it when our only goal is to find the h 
that maximizes /; however, as we will see below, v plays a vitally important role in helping 
us choose our prior q. In the following subsections we describe how to compute / by finding, 
in turn, of the probabilities g, q and v. 



2.2. The Likelihood Function g 



Focus attention first on the likelihood function g(d|h', R, A/", X). If the signal content 
of d corresponds to incident gravitational radiation h, then d — Rh is just detector noise. 
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Correspondingly, the likelihood function g(d|h, R, A/", X) is just the probability that d 
is noise. How do we evaluate this probability? 



Rh 



Gravitational wave detectors, interferometric or acoustic, measure a continuous quan- 
tity: i.e., they are not particle detectors that count discrete events (e.g., photons). The 
statistical proper ties of the noise in these detectors is characterized by its power- and cross- 
spectral density JAbbott et alil2004al : Eazzarinil 12002 . 1200.4 12004 boOfih or, equivalently, its 
mean and covariance kittellll958h ~R The Principle of Maximum Entropy was introduced by 
Javnesl jl957bB l as a means of identifying probability distributions whose "information con- 
tent" is, in a well-defined and relevant way, consistent with this kind of a priori information 
but otherwise incorporates no other assumptions. It was first articulated in the context of 
statistical thermodynamics, where it provided a foundation for understanding the role of the 
Gibbs Ensemble and entropy maximization in statistical mechanics; however, it is ultimately 
a logical statement about statistical inference and, as such, statistical thermodynamics is just 
one application of the Principle of Maximum Entropy. 

The articulation of the Principle of Maximum Entropy was made possible by the devel- 
opment of a theory of information and a (recovered) understanding that probability is a more 
general concept than "relative frequency of occurrence" . That a probability distribution em- 
bodies information is a relatively straightforward proposition to demonstrate. Consider a 
physical system that can be in any one of several states and suppose that we can measure 
certain properties v, . . . of the system whose values depend on — but do not necessarily 
determine — the system's state. For example, the system might be an N- dimensional quan- 
tum simple harmonic oscillator, whose state is determined by the N quantum numbers n\, 
n 2 , . . ., tin, and the observable might be the total energy of the system, // = h(N/2 + J2k n k)- 
If we have no knowledge of the system's state then it is natural that we should regard it as 
equally likely that the system is in any particular state: i.e., if we were to assign a number p, 
< p < 1, to represent our degree of certainty that the system's state is n, with represent- 
ing complete certainty that the system is not in state n and 1 representing complete certainty 
it is in state n, then p would be the same for all n. Now suppose that we measure or are told 
fi is equal to /i : what does this new information tell us about the state of the system? In 
our example, we now know that only those states n satisfying J2k n k equal to p, /h — N/2 are 
possible but that, among these, no n is preferred over another: i.e., the number p we should 
ascribe to state n vanishes if /i(n) is not equal to [1$ and is equal to a constant for all other 
states. Given this distribution we can express our expectations regarding other properties 
of the system: e.g., our expectation that the property v takes on the value h> is equal to the 



3 In fact, the noise cross-spectral density — equivalent to the correlation of the noise between two different 
detectors — has not yet been evaluated for or played a role in the analysis of LIGO data. 
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ratio of the number of states for which (z/, jj) is equal to (uq, fio) to the number of states for 
which \i is equal to p, . These numbers p both represent our degree of certainty regarding 
the state of the system and satisfy exa ctly the same algebraic and relational laws that we 
require of probabilities (lCoxlll946l . If96ll ); so, we will hereafter refer to them as probabilities. 



In this demonstration we see that probability is more general than "relative frequency 
of occurrence" , that it does not require that we invoke an ensemble or make any assumptions 
regarding ergodicity — there is just one system and it is in a definite state — and that it 
represents in a meaningful way our state of kn owledge about th e system. Modern information 
theory was launched in a series of papers by IShannon Jl948cM in which he gave a definite 
meaning to "information content" in the context of a probability distribution, described the 
properties that a measure of information should have, and showed that there exists a unique 
measure of the information content of a probability distribution, which is the negative of 
what is conventionally understood as the entropy of an ensemble. It is more common to refer 
to the negative of the information content as the measure of uncertainty, or (information) 
entropy: 

H (p) = -^Vk^ogp k (4) 

k 

where the sum is over all states of the systemQ For a continuous distribution p(x) the 
summation becomes an integral and we must normalize the probability as it occurs in the 
logarithm by the density of states p(x): 



H{p) 



d n xp(x) loj 



p(x) 
' p(x) 



(5) 



Returning to our previous example, it is clear that there may be a large number of 
distributions q(n) that are consistent with the knowledge that the observed fi is equal to 
Pq. Each of these distributions has an entropy H(q); of these, one — call it q — will have 
the greatest entropy. We can find that distribution by finding the q such that the variation 
SH(q)/6q, subject to the constraint that ^2 k qk = 1, vanishes: 



Qk 



H(q) + \ J2qk 



^ [-1 - logg fc + A ] 5q k 

k 

exp (A - 1) 



(6a) 
(6b) 



The base of the logarithm corresponds simply to a choice of units in which to measure entropy. 
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where the Lagrange multiplier Ao is chosen so that YlkQk = 1- The maximum entropy 
distribution q assigns equal probability to every state n such that p(n) is equal to pq\ i.e., q 
is the distribution that embodies what we understand to be just the information that p{n) 
is equal to po, but no more. The Principle of Maximum Entropy states that this is always 
so: i.e., given a system with possible states x and constraints on the system in the form of 
functions of the state, the probability distribution p(x) that embodies just the information 
in those constraints is equal to the distribution that maximizes the entropy (either equation 
lUorEJ) subject to the constraints. 

We can use the Maximum Entropy Principle to find the likelihood function g — i.e., the 
probability distribution that d — Rh is equal to detector noise — given the characterization 
of the gravitational wave detector noise described by its mean p and autocorrelation function 

Cf. 



N-l 



Ct 



(x) ~ lim N- 1 V 



k=0 



{XkXfo 



+i) ~ lim N 1 (x k - n) {x k+ i - p) 



(7a) 
(7b) 



k=0 



We can construct many example probability densities p(x) whose moments take on these 
values. From among these we desire the particular probability distribution p(x) that has 
maximum entropy: i.e., we desire the distribution that has minimum information subject to 
the constraints that its moments satisfy equations^ This distribution satisfies the variational 
equation 







^p(x) hgp(x) - A I 1 - ^p(x) J - Ai I p -^2x k p(x) ) 

X \ X / \ X / 



21 Q 



^2(x k - n){x M - n)p(x) 



where Ao, Ai and X2/ are Lagrange coefficients, which are chosen to insure that the probability 
is normalized (Ao), that the mean < x k > is p (Ai), and that the auto-correlation < x k x k +e > 
takes on the value q (A2/). Solving equation [H] is straightforward: taking the variation we 
find 



° = E 



(l + log f(x)) + A + AiXfc + ^2 ^/ ^( X k ~ n)(xk+£ - fj) 



i.e., log / is quadratic in x, subject to the constraints of unitarity and equations or 



27r) dim "det||C|| 



-1/2 



cxp 



-^{x-p) T C x {x-p) 



(9) 



(10) 
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where jl is the dimx vector all of whose elements are /i and the covariance matrix C 1 is the 
related to the autocorrelation function by 

Cjk = Ck-j. (11) 

When nothing more than the noise distributions mean and covariance are known the least 
presumptive model for the noise statistics is a multivariate Gaussian. Adopting any other 
noise model makes additional (and, since we know only /x and eg, unjustified) assumptions. 
This important result is contrary to popular prejudice, which views Gaussian noise models 
suspiciously even when nothing more is available for use in analysis than the noise mean and 
covariance. Correspondingly, for the analysis of gravitational wave detector data we have 

log s(d|h', R, A", X) = -\x\^ h', R, N) + const. (12) 

where 

X 2 (d, h', R, N) = (d - Rh') T N _1 (d - Rh') (13) 

may be evaluated directly: i.e., without the need to invert the generally ill-conditioned 
response R. 



2.3. The a prior probability q 

When the observables d over-determine the incident wave h it is tempting to ignore 
q(h'\T) in equation [3j minimize x 2 (i- e -> maximize the likelihood) over h', and declare that 
h' is the inferr e d inc ident wave. This is, with minor variation, the approach taken by 



Giirsel fc Tintol (119891 ). While an entirely legitimate approach to deconvolution, maximum 
likelihood methods generally over-fit d: i.e., they find a h' that leaves a residual d — Rh' 
that is inconsistent with the known noise properties Af. One role played by q(h'\T) is to 
resist this tendency toward over-fitting noisy data. 

One commonly thinks of the a priori probability q(h'\l) as a representation of our 
general expectation signals h prior to the particular observation d. Criticisms of Bayesian 
analyses generally focus on the necessity of invoking a priori probabilities like q. A great deal 
of intellectual energy has been devoted to developing ways of identifying a priori probabilities 
that are, in some meaningful sense, without prejudice. The Maximum Entropy Principle, as 
described above, provides one means of doing so, which is especially useful when the set of 
sy stem states is discrete or the density of states is known. Another approach, also pioneered 



by I Jayned (119681 ). takes as its starting point the idea that the prior q should be equivalent for 



equivalent experiments: more formally, that q should be form-invariant under the same set of 
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transformations that leave the experiment invariant. Such priors encode only our knowledge 
of the experiment's nature and do not presuppose any outcome. We can use that principle 
in the present instance to find the class of priors q that are the least presumptive regarding 
the signal h. 

Let us first focus on some general considerations that should govern the prior q. Writing 
h as the sum of two time series, corresponding to the two polarizations h + and h x of an 
incident gravitational wave, we note that 

• Lacking any reason to presume that the source is oriented in a particular way relative 
to the detector line-of-sight, the prior should be invariant under an arbitrary rotation 
of + into x polarization; 

• Lacking any reason to presume that the gravitational wave burst arrives at a particular 
moment in time, the prior for h{t) and h(t + r) should be the sam^f]; and, finally, 

• Lacking any specific source model that dictates or suggests how energy is distributed 
throughout the burst, the prior should not favor waveforms with any particular degree 
of smoothness (or autocorrelation or spectrum). 

We could, of course, introduce specific knowledge or assumptions into the construction of 
the prior, if we have any: for example, we may believe that the gravitational wave power 
should be concentrated in a band of frequencies, or that there is a relationship between the 
radiation in the + and x polarization states, etc. In the present instance, however, our 
goal is to specify a prior that makes the fewest possible assumptions (beyond propagation 
direction, which we take to be known) about the nature of the gravitational wave signal 
incident on our detector (s). 

With these general considerations in mind we may write h + (h x ) in a Fourier expansion: 

h k = Aj cos(u jt k ) + Bj sm(u jt k ). (14) 

j 

The prior on h can just as well be expressed as a prior Q on the A and B. From our general 
considerations and the property of Fourier series we conclude that 

Q(A,B\1) = Y[Q(A k \l)Q(B k \T); (15) 

k 



5 In the case of galactic core-collapse supernova we expect that the arrival time of the observed neutrinos 
will coincide with the arrival time of the gravitational wave burst up to uncertainties in the time they spend 
trapped in the dense core. Within these uncertainties we treat the arrival time of the gravitational wave 
burst as unknown 
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i.e., the overall prior is the product of priors of identical form evaluated for each of the 
coefficients A k and B k . 



Bretthorstl (119881 ) provides a particularly lucid derivation, which we reproduce here, of 



the prior Q that arises from the transformation group properties that we expect it should 
satisfy. To begin, we might just as well have written the Fourier expansion of h in terms of 
the Fourier amplitudes cij and phases 9j, as in 

h k = dj cos(ujjt k + 9j), (16) 

3 

with the (cij, 9j) related to the (Aj, Bj) by 

a) = A) + B) (17a) 
tan^- = Bj/Aj. (17b) 

The same considerations that led us to factor Q(A, B\T) into the product of identical func- 
tions Q of the Ak and Bk lead us to factor the prior P(a, 6\T) into the product of priors P 
of identical form, evaluated for each of the coefficient pairs {a k , 9k)'- 

P(a,9\l) = ]JP(a k ,9 k \l) (18) 

k 

Under the assumption that we do not know the signal arrival time all phases 9 are equally 
likely: i.e., 

P(M|Z) = ^©. (19) 

The choice of Fourier coordinates (A k ,B k ) or (a k ,9 k ) does not affect our prior knowledge; 
so, the prior distributions P and Q must be equivalent: i.e., 

Q{A\l)Q{B\l)dAdB = ^^dad9. (20) 

2n 

In the special case where B is equal to zero this relation becomes 

p(a\l)dad9 = 2tv Q(a\l)Q(0\l) dAdB. (21) 

Using this expression for p in equation [20] we find that the desired prior Q must satisfy the 
functional equation 

Q{x\X)Q{y\X) = Q Q(0|X), (22) 

which has the general solution 

Q(Ao\t) = BXPl ^ 2] (23) 



13 



for some unknown parameter a 2 . 

We are thus led to a one parameter family of "uninformative" priors q a (h\T): 
q a (h\l) = Hq a (A k \2)q a (B k \2) 



exp [-^J2to(A 2 k + B : 



cxp 



1 spiV-1 , 2 " 
2^ Z^fc=0 '** 



(2tTCX 



2^/2 



(24a) 
(24b) 
(24c) 



Setting aside for the moment the question of how we might choose a, the choice of prior 
q taking the form q a (h\X) leads us to the posterior probability density 

f(ti\d,R,J\r,a,I) oc g{d\h!,H,N,l)q{h!\a,T) 



oc exp 



i X 2 (h , ) R,N ! d) + ^(h') 



where 



S(h0 = -$>'£. 



We may thus choose as the best estimate of h the h' that minimizes In/, or 

F(h'|d, R, A/\ a 2 ) = i X 2 (h\ R, N, d) - ^S(h). 



(25) 
(26) 

(27) 
(28) 



Stripped of its Bayesian statistical motivation, we recognize that maximum entropy 
deconvolution of the gravitational signal h is closely related to the solution of the inverse 
pr oblem via repola rization, with the specific choice Ylk ^1 f° r the regularization function 
S (iNeumaierl Il998l ) . Nevertheless, our Bayesian approach to inference is much more than 
a motivation for a particular choice of regularization function, or even for deconvolution 
via regularization. Whereas deconvolution provides a single estimate of the signal h, our 
Bayesian approach provides a distribution /, whose mode is the point estimate of the equiv- 
alent deconvolution problem but that also assigns a probability to any proposed h'. The 
availability of this distribution permits, among other things, a detailed error analysis for any 
problem where the details of the waveform play an important role. 



2.4. Eliminating the regularization constant 



If we view the determination of h from d as a problem in deconvolution via regulariza- 
tion, then we require some prescription for the choice of regularization constant a 2 . Viewed 
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as a problem of Bayesian inference, however, a 2 is not unlike any other unknown and we'd 
prefer, instead, to find some probability distribution u(a 2 ) that describes the probability that 
a 2 is the appropriate choice and integrate the q a over u to find a distribution q independent 
of a 2 : i.e., 

q(h\l) = J da 2 u(a 2 )q(h\a 2 ,l) (29) 

Suppose we let the data guide us to a choice ofu(a 2 ): i.e., we use Bayes' Theorem to construct 
the probability distribution u of a 2 , based on the observations d: 

M(a 2 |d,AT,R,J) oc J Dhg(d\h',R,M, l)q(h'\a 2 ,l)t(a 2 \l) (30) 

where we have introduced the prior t(a 2 \X). 

In general we have no knowledge that suggests a preferred value of a 2 ; correspondingly, 
any reasonable prior t should be approximately constant over a wide range of a 2 . Addi- 
tionally, when the data d are informative then gq a will be a sharply peaked (relative to t) 
function of a 2 . For both these reasons we may treat t as constant over the support of gq a as 
a function of a 2 in the integrand of equation [29J, in which case 

u(a 2 |d,Af,R,J) oc / Dh.g(d\h','R,J\f,T)q a (h'\T) (31) 

This integral we recognize as the normalization constant v that appears in our expression 
for / (cf. eq. [3]) when evaluated for a particular value of o. In addition to allowing us to 
infer the signal h our observations d also provide us with a probability distribution for the 
"regularization constant" a 2 . 

For our problem the distribution u(a 2 \d, A/", R,X) involves only Gaussian integrals and 
is relatively straightforward to calculate: recalling our definition of \ 2 (cf- ec L- fT3"j) and intro- 
ducing 



J Ddexp 
Dhexp 

Dhexp 



2 X 

2a 2 
1 2 



(2vr) iVd det||N| 
(2na 2 ) Nh/2 



1/2 



2a s 



(2tt 



det a- 2 I + R T N- x R 



1/2 



(32a) 
(32b) 

(32c) 



we find 



21ogw 



-X 2 (d,Rh ,AT)- ( T- 2 S(h )+21og 



ZgZ q 



-Xo 



-^-N d log 2vr + 2N h log a - log det | |cr~ 2 I + R T I\P 1 R| | , (32d) 
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where ho is the h that maximizes g(h)q a (h), Nj, is dimd, and Nh is dimh 

We can generally approximate equation [2H] for q(h\X) by q^(h\I), where a 2 maximizes 
u(a 2 \I). While conceptually we do not make a choice of a, as a matter of practice if a signal 
is recoverable / depends at most weakly on q and the desired prior q is also generally close 
to q„. To find b we can extremize our expression for logw (cf. |29{) over a: 







dlogu 
da 



a 



a~ 3 tr 



(a- 2 l + R^N^R) 



-i 



(33a) 



or 



a 



N h -tr [(I + ^R^N-iR) -1 ]' 



(33b) 



where we made use of the relation 



lndet 1 1 A|| = trln||A||. (33c) 

Equation I33bl provides an implicit relationship for a 2 , which can be solved iteratively with 
the minimization of equation [28] for h: e.g., we can solve equation [28] for h given a guess for 
a 2 and use this (<r 2 ,h) pair on the right-hand side of equation I33bl to find a new estimate 
for b 2 , with the process repeated until it converges. 



2.5. Example 

As an example, consider a plane gravitational wave signal propagating from a hypothet- 
ical source directly overhead of the LIGO Hanford Observatory (LHO) site, and incident on 
detectors at the LIGO Hanford and LIGO Livingston observatory sites. For this example 
assume that 



• Each site has a single, identical interferometric gravitational wave detector oriented in 
the same way as the actual detectors at each site; 

• The response function for each detector is frequency independent in the band of inter- 
est; 

• The detector noise is independent between detectors and white (i.e., the autocorrelation 
is a delta function in lag), with unit variance in each detector; 

• The sample rate at each detector is 4096 Hz. 
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For the signal use 
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with gravitational wave polarization tensor e x orthogonal to tangents to lines of latitude, e + 
orthogonal to e x , and 4>o is chosen so signal projection on the LHO detector is maximized. 

Figure [1] shows snippets of the simulated data from five different data sets d. Each data 
set corresponds to the (noisy) observation at the detector network described above when a 
signal of amplitude h equal to 1, 2, 5, 10, or 20, is incident on the detector network. Each 
snippet is centered on the location of the actual signal, which was 100 samples in duration 
and embedded in a data set five times as long. Figure [2] shows the most probable waveforms 
h' + and h' x as inferred, in the manner just described, from each data set. Finally, figure [3] 
shows the residual d — Rh' for each data set and inferred waveform h'. Table [1] shows the 
power signal-to-noise ratio p 2 of the recovered signals, calculated ag^| 

p 2 = (Rh'f N- 1 (Rh') = x 2 (0, h', R, N) (35) 

and the correlation between the actual and the inferred signal, calculated as 

C(j) = ^ffi* (36a) 

where 

|x| 2 = J>*. (36b) 

k 

The value of p 2 and C(j) recorded here should be taken as representative: their actual 
values depend on the noise instantiation, which varies from simulation to simulation (and 
from observation to observation). 



6 It is important to note that the signal-to-noise as defined here involves only the inferred signal: i.e., it 
is the observed signal-to-noise, which varies from simulation to simulation with the different instantiations 
of the noise. Calculated in this way it is a different quantity than the signal-to-noise often quoted in the 
context of experimental results published by the L1GO Scientific Collaboration, who report their sensitivity 
in terms of the expectation value of the sig nal-to-noise for a know n signal — not an inferred one — supposed 
to be present in the data (see, for example Abbott et al. ( 2004b[ )). 
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Comparing figures [T] and [2] we see the degree to which out inference procedure is able 
to separate signal from noise. For the cases h equal to 1 or 2 no signal is resolved: i.e., 
ho is uniformly small and, in fact, much smaller than the rms noise. For the three cases ho 
equal to 5, 10 and 20 the inference procedure clearly does resolve distinct signals in each of 
the two gravitational wave polarization. The maximum cross-correlations (cf . equation 136]) , 
tabulated in table [lj makes the correctness of the inference as a function of ho quantitative: 
the correlation between the real and inferred signals is greater than a half for signal ampli- 
tudes greater than 5, and inspection of figure [2] shows that the error is principally in the 
signal amplitude and not its time dependence. 

Should we expect to be able to do better? Estimates of the required power signal- 
to-noise for reliable detection of a gravitational wave burst in LIGO based solely on the 
coincidence of ex cess power in the detector data streams^ range from 30 to 70 f Thorne 1987 : 



Finn! Il99ll . Il992l ). Referring to table [T] it is apparent that a reliable signal emerges very 
rapidly between signal amplitudes ho of 2 and 5, corresponding to p 2 in this range. Figure [3] 
shows Rh' — d, the residual after the inferred signal is subtracted from the observations. If 
we have done a good job of separating the signal and noise contributions to the observations 
d than we expect that Rh' — d should be noise: in this example, unity rms white noise. In 
the cases h equal to 1 or 2 inspection of the residuals shows no obvious evidence of structure 
left behind by the inference procedure. For the case ho equal to five and greater, where a 
clear signal is identified, the residual does show some evidence of structure near the location 
of the signal peak. The amplitude of this apparent structure is very much consistent with 
our assumptions about the detector noise (i.e., unity rms): it is only the time dependence of 
the structure that draws the attention of the eye. Over longer random time series, even this 
would not appear to be particularly unusual: i.e., were we not aware that we were looking 
at a residual we would not have reason based on statistics to be particularly suspicious of 
the presence of a signal here. Reviewing the construction of our inference procedure, the 
quantitative agreement between our estimates of the required signal-to-noise for a reliable 
detection, and the qualitative evidence of these residuals, it is clear that absent additional 
information about the nature of the noise (which would modify our function x 2 ) or the nature 
of the signal (which would modify our choice of function S) we cannot reasonably suppose 
that there is any further information in the residuals about a possible signal. 



7 And calculated in a manner comparable to the p 2 recorded in table 1 



-18- 



2.6. Summary 

The problem of identifying a signal in noisy data can be approached as a problem of 
statistical inferrence: i.e., we can find a probability distribution that describes the credibility 
that we should ascribe to the hypothesis that an arbitrary gravitational wave burst h was 
incident on an array of detectors given observations d. This probability distribution involves 
two components: the probability g that d — Rh is detector noise, and the a priori probability 
q that a signal takes the form h. 

The probability g, also known as the likelihood function, requires a characterization of 
the noise. In the absence of a complete characterization of the noise — something that is 
rarely, if ever, possible — the Maximum Entropy Principle provides a means of selecting the 
probability g that is simultaneously consistent with what is known about the noise and least 
presumptive about what is not. In particular, when the noise is characterized by its mean 
and covariance — the usual case — then g takes the form of a Normal distribution. 

The a priori probability density q characterizes our prior expectations regarding signals 
h. In the spirit of "letting the data speak for itself" we avoid prejudicing our analysis with 
preconceived notions regarding the nature of potential sources by focusing instead on our 
ignorance of signal polarization, arrival time, and energy spectrum. In this way we are led 
to a one-parameter family of potential priors q a . Rather than choose a particular prior from 
this family, we find that the data allow us to provide probabalistic weights u(a) to different 
values of the parameter a. Rather than make a particular choice of distribution q a , we 
evaluate the expectation value of q a over the distribution of the parameter a and use it for q, 
eliminating our dependence on the unknown parameter a. As a practical matter, it is often 
sufficient to find the a that maximizes u(a) and use q^ for q. 

3. APPLICATION TO LIGO OBSERVATIONS OF SN 

How well can we hope to characterize the astrophysics of core-collapse supernovae from 
the gravitational wave signature we observe? The detailed astrophysics of core-collapse 
supernovae is uncertain, difficult to model, and involves large-scale convection and other 
stochastic processes. For all these reasons, it is more likely that we will infer the features of 
the gravitational wave burst associated with a supernova and then use that inferred signal 
to validate our models, than that we will "discover" the gravitational wave signal from a 
core-collapse supernova by using matched filtering to extract it from deep within a noisy 
data stream. 

As a first step toward exploring how, and how well, we may be able to diagnose the con- 
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ditions of the collapsing core from the gravitational waves it radiates we use the techniques, 
described in the previous two sections, to infer a simulated signal embedded in LIGO-like 
noise and evaluate the cross-correlat ion betwe e n the inferred signal and a wide range of 
signals calculated from the models of lOtt et al.1 (120041 ) . We do this for 



• LIGO data drawn from the S1-S4 science runs, evaluating the maximum distance at 
which there is a significant correlation between the inferred and simulated signals (cf. 
sec. ETOl : 

• Simulated signals with varying maximum central density (cf. sec. 13.2.21) . evaluating 
our ability to evaluate the properties of the core-collapse; 

• Simulated signals with varying progenitor mass, evaluating our ability to test the cor- 
relation between progenitor models and the configuration of the collapsing stellar core 
(cf. sec. 13X31) ; 

• Simulated signals with varying angular momentum and differential angular velocity, 
allowing us to explore the role that differential rotation plays in core collapse (cf. sec. 



We find that, even when the simulated signal is so weak that the inferred signal appears 
very different, the inferred signal still has its greatest cross-correlation with the simulated 
signal. We conclude that the model whose signal has the maximum cross-correlation with 
the inferred signal is likely to provide a good indication of the physical properties of the 
source. 



3.1. The Gravitational Wave Signature of Core-Collapse Supernovae: 

Description of Core-Collapse Models 



tude of processes, including rotating core collapse and core 


Dounce (e.g., 


Frver et a 


2002: 


Dimmelmeier et al. 


2002; 


Ott et al. 


2004; 


Dimmelmeier et al. 


2007a 


b 


Ott et al. 


2007) 


, post- 



boun c e connective over turn, anisotropic neutrino emission ( Burrows &: Hayesl 1996TMiiller et al 



2004; Ott et a 



(IRampp et al 



1998 



2007) , nonaxisymmetric rotational instabilitie s of the protoneutron star 



Ott et al.l 120071 : IShibata fc Sekiguchi 



posed protoneutro n star core g-mode oscillations (IBurrows et al. 



200F 



5 L or from the recently pro 



2006, 2007; Ott et al. 2006 



Ferrari et al.ll2003l ). I n addition and in the con text of the core-collapse supernova — gamma- 
ray burst connection (jWoosley fc Bloomll2006l ). late-time black hole formation in a failed or 
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weak core-collapse supernova may accompany gravitational wave emission from quasi-normal 
ring-down modes of the newly-formed black hole. 

Of all the above emission processes, rotating iron core collapse and bounce is the most 
extensively modeled and best quantitatively and qualitatively understood. For these reasons 
we limit our present study to an analysis of the rotating core collapse and bounce signature 
only and use example templates compu ted from a set of 2D axisymmetric Newtonian core 
collapse simulations by lOtt et al.l (120041 ) which focused on the dynamics of rotational collapse 
and bounce. These represent a class of signatures of core collapse and are good templates 
with which to exercise the signal extraction technology we have developed. This paper is the 
first to put theoretical models of the gravitational wave signals of core collapse and bounce 
through a realistic detector pipeline and to attempt to extract physical information using 
sophisticated signal processing algorithms. 

The astrophysics models involved stell a r pro genitors with various masses: 11, 15, 20, 
and 25M calculated in lWoosley fc Weaver! (119951 ). The simulations neglected the effects of 
neutrinos, general relativit y, and magnetic fie l ds, bu t used the realistic, finite-temperature 
nuclear equation of state of lLattimer fc Swestvl (119911). A sma ll nu mber of simu l ations also in- 
vestigated stellar progenitor models from lHeger et al.l (120001 ) and lHeger et al.l (120041 ). which 
were evolved to the onset of iron core c ollapse with an approximate treatment of rota- 



tion dHeger et al 



netic torques (IHeger et al 



2000 



H 



eger 



et al.l 120041 ) and angular momentum redistribution by mag- 



20041 ) . The gravitational wave signature extra ction was performed 



using the Newtonian quadrupole formalism (see e.g. iMisner et al.lll973l ). 



The effects of rotation were investigated in lOtt et al.l (120041 ) . The initial rotation of the 
progenitor was controlled by two parameters: the rotation parameter (3 where 



E r , 



E. 



(37) 



grav 



and the differential rotation scale parameter A, which is the distance from the rotational 
axis at which the rotational velocity drops to half that at the center. A is defined as 



, r 

1 1 1 



(38) 



where r is the distance from the axis of rotation and Q(r) is the angular frequency at r. 

When the progenitor is rotating slowly and (3 is small (zero to a few tenths of a percent), 
the collapse is halted when the inner core reaches supranuclear densities. The core bounces 
rapidly and then quickly rings down. When the progenitor rotates more rapidly and (3 is 
larger, the core collapse is halted by centrifugal forces and the core bounces at subnuclear 
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densities. The core then undergoes multiple damped, harmonic oscillator-like expansion- 
collapse-bounce cycled The initial degree of differential rotation affects the value of (3 at 
which this bounce type transition occurs. A progenitor with a smaller value of A experiences 
a greater amount of differential rotation and hence, a more rapidly rotating inner core. As 
a result, the transition from a supranuclear to a subnuclear bounce occurs for a lower value 
of f3. 



The models of lOtt et al.l (120041 ) yield absolute values of the dimensionless maximum 
gravitational wave strain h max m the interval 2xl0~ 23 < h max < 1.25 xl0~ 20 at a detector 
distance of 10 kpc. The total energy radiated (Eqw) lies in the range 1.4x 10~ n M Q c 2 

< -Eqw — 2.21 x 10~ 8 M c 2 and most of it is emitted in the primary gravitational wave 
burst associated with core bounce. The energy spectra peak in the frequency interval 20 Hz 

< /peak < 600 Hz with rapid and differential rotators having peaks at low frequencies and 
moderate and rigid rotators peaking at high frequencies. 



3.2. Gravitational wave observations of rotating stellar core collapse 



In all the simulations described here we used lOtt et al.l (120041 ) model sl5 Al000B0.1 
as the "real" signal, and correlated the inferred signal with oth er models in the lOtt et al. 
d2004h catalog. Model sl5Al000B0.1 corresponds to a 15 M IvVooslev fc Weaver! Jl995h 
progenitor with rotation parameter {3 equal to 0.1% and differential rotation scale parameter 
A equal to 1000 km. We scaled this signal to represent core collapse events at different 
distances and projected the incident signal onto the LIGO 4-km Hanford WA (LHO) and 
Livingston, LA (LLO) detectors. For the purpose of this st udy we as s umed that the core 
collapse was directly overhead of the LHO site0 Since the Ott et al. ( 2004 ) core collapse 
models are axisymmetric the gravitational waves they emit are linearly polarized. We chose 
the polarization angle to maximize the response of the LHO detector and used the actual 
response functions for the LHO and LLO detectors characteristic of LIGO's S4 science run 
( [Gonzalez et al.ll2004l ). We simulated the detector noise by adding white noise with power 
spectral density amplitude approximately equal to the noise amplitude at 100 Hz in the 



8 Recent results of lOtt et al.l (|2007l ) and iDimmelmeier et al 



i) suggest that such multiple-bounce 
dynamics are less likely when general relativity and deleptonization are taken into account. 

9 In this work we have always assumed that the gravitational wave signal we observe is accompanied by 
a neutrino or electromagnetic signal that determines the source sky position. 
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corresponding science run ( Lazzarinill2005l ) J 10 l Finally, we used the maximum entropy method 
described above to find the best point estimate of the e mbedded signal and cross-cor r elated 
this estimate signal with different signals drawn from the lOtt et al.l (120041 ); lOtt et al.l (120071 ) 
parameter survey The subsections below describe our observations based on this study. 



3.2.1. Science Run and Survey Range 



Before discussing how well we can distinguish between different core collapse models we 
looked at how close a core collapse event would have needed to be for LIGO, during its first 
four science runs (SI, S2, S3 and S4), to infer its waveform with reasonable accuracy. For this 
purpose we used the detector resp onse functions and noise power spectral density amplitudes 



for the correspon ding science run (lAdhikari et al.ll2003l ; iGonzalez et al.ll2004l ; lLazzarinill2002 
20031 . 120041 . [2005h . 



Figure H] shows the maximum cross-correlation, for each of the first four science runs, 
between the inferred and actual waveforms as a function of the core collapse distance. There 
is a steady improvement, from SI to S4, in maximum entropy's ability to recover signals 
at greater distances, corresponding to improving detector sensitivity: by S4, we are able 
to infer the gravitational waveform from core collapse events that occur as far as a few 
kpc away. The up-tick in cross-correlation at the 1 kpc mark in curve for S3 sensitivity is 
the result of a discontinuous change in the most probable waveform (i.e., a new extrema 
becoming the global maximum in the probability function.) The investigations described in 
the following sections, which delve into the source information present in the inferred signal, 
use the response and noise characteristic of the LIGO S4 science run. 

An alternative way of expressing the results summarized in Figure H] is to describe the 
sensitivity required of a LIGO-like detector to observe an optimally located supernovae to a 
given distance. Under the "white noise" approximation made here, doubling the distance to 
which LIGO is able to infer the waveform from a core collapse supernova requires doubling 
its sensitivity: i.e., halving the rms noise amplitude at 100 Hz. Referenced to LIGO's design 
sensitivity at 100 Hz, which was achieved during LIGO's S5 science run, and a 70% maximum 



10 At the frequencies where the signal power (in units of squared strain) of the simulated core collapse event 
gravitational wave signals peaks (~500 Hz) the noise power spectral density in any of the current generation 
of interferometric gravitational wave detectors is increasing relatively slowly with frequency. (Very little is 
known concerning the cross-correlation in the noise of two separated detectors, other than that it is very 
small.) While we have chosen to make this demonstration with white noise, the maximum entropy analysis 
method introduced above naturally accommodates any noise covariance: cf. the paragraph including equation 

HTJ. 
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cross-correlation, the maximum distance at which core collapse supernova are expected to 
be observable is 

/q x in- 23 H7" 1/2 \ 

= 7S3 mm ) m 

where Sh{f) is the strain-equivalent noise power spectral density. The intermediate LIGO 
upgrade underway at this writing is expected to reduce Sh by a factor of four, and the 
adva nced LIGO u pgrade that will follow will reduce Sh by another factor of approximately 
100 (ILIGOl 120071 ). Correspondingly, we expect that advanced LIGO will be capable of 
observing core-collapse supernovae like those modeled here at distances as great as 80 kpc: 
i.e., beyond the distance to the LMC and SMC, but far short of the Virgo cluster. 



3.2.2. Bounce Type 



We can classify the models of lOtt et al.l (12004 ) into those that bounce at supranuclear 



subnuclear and transitional central densities. Figure [5] shows the maximum cross-correlation 
between the inferred waveform, the actual waveform (a supranuclear bounce type), and three 
other waveforms that each have, within their respective categories (supernuclear, subnuclear 
and transitional), the greatest cross-correlation with the inferred waveform. (Figure O shows 
these four waveforms themselves, each with strain scaled to a distance of 10 kpc.) It is clear 
that the inferred waveform has the most in common with that generated from a model with 
the same, supranuclear, bounce type and that, for S4 detector sensitivities, our ability to 
distinguish bounce type fails for core collapse events more than 3 kpc distant. 

To facilitate comparison with signal-to-noise ratio sensitivities as reported by the LIGO 
Scientific Collaboration for the LIGO detectors, we show the quantity "SNR 2 " , defined by 

SNR 2 = l(H 2 1+ H-i), (40a) 

= < 40b > 

3=0 k 

where h is the actual waveform, running across the top of this and subsequent figures. The 
quantity SNR 2 can be compared to the single-detector, optimal orientation, expectation 
value of the power signal-to-noise ratio sensitivities reported by LIGO in its publications: 
i.e., assuming that the actual gravitational waveform were known, the expectation value of 
the signal-to-noise determined by matched filtering would be given by SNR. 



-24- 



3.2.3. Mass 

Figure [7] shows the cross-correlation between the inferred and actual waveforms associ- 
ated with core collapse models that differ by progenitor mass, but share the same rotational 
parameters. Figure [8] shows the waveforms used for the cross-correlations. Again, for S4 
sensitivities the inferred waveform most closely resembles the waveform from the model with 
the same mass at distances up to 2-3 kpc. 

3.2.4. Rotation 

Figure [9] shows the cross-correlation between the inferred waveform, the actual wave- 
form, and the waveforms associated with a set of models that differ only by rotation pa- 
rameter /?, while figure [TU] shows the cross- correlations for models that differ only by the 
differential rotation parameter A (cf. eq. |38{) . Figures [TT1 and [T2l show the waveforms for 
the set of models whose cross-correlations are shown in figures and [101 As expected, the 
cross-correlation decreases as the rotational parameters of the models depart from those 
associated with the actual waveform. 

4. DISCUSSION/CONCLUSIONS 

We have described and demonstrated the use of a Bayesian method, in the spirit of 
the so-called Maximum Entropy methods for analyzing astronomical image data, for infer- 
ring the time-dependent waveforms of the radiation incident on a network of gravitational 
wave detectors. In contrast to the methods developed for image analysis, which rely on the 
Principle of Maximum Entropy to select an a priori probability for use in the Bayesian anal- 
ysis, we use the MaxEnt Principle to determine the appropriate choice of likelihood function 
for describing gravitational wave detector noise and use transformation group arguments to 
identify an appropriate, uninformative a priori probability density. We have demonstrated 
that the method described is quite capable of inferring the presence of signal in detector noise 
once the detector response to the signal begins to approach the noise rms, and very quickly 
converges on the correct signal amplitude and temporal structure of the incident signal as 
the actual signal amplitude increases. We have argued, based on the methods construction, 
a quantitative assessment of the correlation between the inferred and actual signal and the 
observed and expected signal-to-amplitudes associated of simulated data sets, and on a qual- 
itative review of the residuals following subtraction of the detector networks response to the 
inferred signal from the observations themselves, that the method developed here is the best 
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that one can expect to be able to do for detector networks whose noise character is known 
only via its power spectral density and for signals that are assumed to have arbitrary time 
dependence. 

Inferring the gravitational wave signal incident on a set of detectors is but the first task of 
analysis. Our ultimate goal is to use the inferred signal to understand the radiation source. 
We demonstrate how this might be done using gra vitational wave s ignatures calculated 
from the ro tating core colla pse supernovae models of lOtt et al.l (120041 ) . Using simulations 
drawn from lOtt et al.l (120041 ) we have generated simulated LIGO observations of core collapse 
supernovae, used our Bayesian analysis method to infer the most probable gravitational wave 



signal content of the data , and corr e 



ated this inferred signal with a wide range of signals 



drawn from the catalog of lOtt et al.l (120041 ) . We have found, in the context of this example, 



that at the signal-to-noise ratios expected to be used as a threshold for detection of core- 
collapse supernova burst events in LIGO, the correlatio n is m aximized when the inferred 
signal is matched against its parent from the lOtt et al.l (120041 ) catalog: thus, the analysis 
method we describe here infers the incident gravitational wave signal with enough fidelity 
to distinguish between models of different maximum central density, angular momentum, 
differential rotation, and progenitor mass. 

To make the comparison between the inferred and model waveforms we used the max- 
imum of the cross-correlation between the two, choosing the model whose maximum cross- 
correlation with the inferred waveform is the greatest. A Bayesian treatment that assigns 
odds to different models given the inferred waveform is a natural next step; however, we 
note that a Bayesian treatment would involve the exponential of the cross-correlation func- 
tion (as opposed to the maximum cross-correlation) marginalized over the lag. When the 
cross-correlation is a sharply peaked function, this will be proportional to the exponential of 
the maximum of the cross-correlation among the different models. Thus, we conclude that 
the cross-correlation as we use it here is almost certainly likely to choose the same model 
as a more sophisticated Bayesian analysis, except in those cases where the waveforms of the 
different models are nearly indistinguishable. 

While our example applications have been in terms of two interferometers, there is 
nothing in the description of the method that specifies the number of detectors and the 
application to three or more detectors is no different than for one or two detectors. The 
example application provided here involved just the two main LIGO detectors; however, 
we could just as well have augmented the analysis by including simulated data Virgo, GEO, 
TAMA and the operating bar detectors. For the particular simulations described here, adding 
any one of these detectors would most likely not improve our estimates of the signal since 
- for the examples presented here — we chose to place the source in the most favorable sky 
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location and orientation for detection by the LIGO Hanford Observatory (LHO) site. For a 
source so positioned the strain amplitude in the LIGO Livingston detector is 89% of that for 
the LHO detectors, while for the GEO600 site detector it is 42%, 22% for TAMA300, and 
1% for Virgo. On the other hand, the addition of these other detectors to a real network 
would increase the overall sensitivity across the sky, as a source optimally oriented for Virgo 
would, by reciprocity, be very poorly oriented for the LHO detectors. 

Gravitational wave astronomy is not yet a reality. Nevertheless, with the detectors now 
operating and the enhancements currently underway, it is only a matter of time before we 
have observations of from these instruments, from which we can gain insight into astronomical 
phenomena that are otherwise hidden from our sight. Methods like those described here 
will allow us to move beyond gravitational wave detection and realize the full promise of 
gravitational wave astronomy. 
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Table 1: Signal to noise p 2 and maximum cross-correlation maXj-C(j) for a simulated gravi- 
tational wave signal incident on a two-detector LIGO-like network. The signal and detector 
network model are described in §2.51 . 



h 


P 2 


maXj C(j) 
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0.29 


0.08 
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1.5 


0.11 


5 


1.5 x 10 2 


0.54 


10 


1.0 x 10 3 


0.83 


20 


5.4 x 10 3 


0.94 
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Fig. 1. — A short snippet of the simulated data surrounding the embedded signal in the 
simulations described in §2.51 The embedded signal was approximately 100 samples long 
and centered in the snippet shown. The demonstration analysis was carried out on a data 
segment five times longer than the signal. See §2.51 for more details. 
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Fig. 2. — The inferred gravitational wave signal in the + and x polarization states for the 
five data sets described in §2.51 Each panel here corresponds to the data shown in the 
corresponding panel of figure [U Note that the two detector network does an excellent job of 
rejecting detector noise (which has unity variance) and that the actual signal emerges very 
rapidly once the signal amplitude begins to exceed the noise amplitude. See §2.51 for more 
details. 
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Fig. 3. — The residuals d — Rh' following subtraction of the response of the simulated 
detector network to the inferred signal (Rh') from the observations (d) made in the network, 
as described in §2.51 Each panel here corresponds to the data shown in the corresponding 
panel of figures [T] and [2J The amplitude of the residuals is consistent with detector noise and 
the temporal structure of the residuals is, in the absence of additional knowledge regarding 
either the actual signal or the detector noise, also consistent with the detector noise. See 
§2.51 for more details. 
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Fig. 4. — Maximum cross-correlation between estimated and initial signals versus core col- 
lapse distance for data simulated using different science run detector impulse responses and 
noise levels. Th e estimated signa l is inferred using maximum entropy from simulated de- 
tections that use lOtt et al.l (120041 ) model sl5Al000B0.1 as the initial signal waveform. The 
up-tick in the S3 run, located at approximately 1 kpc, is the result of a discontinuous 
change in the most probable waveform. There is a steady improvement in maximum en- 
tropy's ab ility to reconstruct fainter, more d istant signals as the sensitivity of the detectors 
improved jLazzarinilhool 120031 . booi boosl ). 
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Fig. 5. — Maximum cross-correlation between reconstructed waveforms and waveforms as- 
sociated with models that differ by bounce type versus core collapse distance and SNR 2 . 
The SNR 2 shown is the average for the two detectors. The SNR 2 for the 4-km Hanford 
detector is 1.05 times that shown while the SNR 2 for the 4-km Livingston detector is 0.95 
times that shown. The reconstructed waveform is i nferred us i ng m aximum entropy from 
simulated detections that use the waveform from the lOtt et al.l (120041 ) model sl5Al000B0.1 
as the initial signal waveform as well as detector responses and noise levels from the fourth 
science run (S4). The solid line represents the maximum cross-correlation between the re- 
constructed signal and the initial signal waveform. The other lines represent the maximum 
cross-correlations between the inferred waveforms and the waveforms resulting from each 
bounce type for which the maximum cross- correlation at 1 pc is greatest, excluding that 
used for the initial signal. The inferred waveform is most similar to those generated by 
models with the same, supranuclear bounce type as the initial signal waveform, for the 
simulations corresponding to core collapse events that occur less than 2-3 kpc away. 



-38- 



x 10 



J 


^|W\av\a /v -' — " — ■ 




Initial Signal 
S15A1000B0.1 



20 40 60 80 



x 10 



J 






1|VW\Ma— 

Supranuclear 
S20A50000B0.2 



20 40 60 80 



5 

-5 
-10 



x 10 



-21 









^Va^n ■ 

Transitional 
S11A1000B0.2 



20 40 60 80 
Time (ms) 



5 

-5 
-10 



x 10 



-21 









Subnuclear 
S11A500B0.2 



20 40 60 80 
Time (ms) 



Fig. 6. — The waveforms associated with various bounce types that are compared with the 
inferred signal in Fig. [5l The upper left plot shows the waveform (from Ott et al. [2004] 
model sl5A1000B0.1) that was used as the initial signal in the simulated detection. The 
three other waveforms shown are those that a re most similar to this initial signal waveform, 
for each bounce type. The waveform from the lOtt et al.l (120041 ) s20A50000B0.2 model looks 
much like the initial signal waveform which is of the same, supranuclear bounce type. The 
subnuclear bounce waveform shows the effects of multiple damped, harmonic oscillator-like 
expansion-collapse-bounce cycles. The zero points of the time axes are chosen so that the 
minima of the waveforms occur at the same time for ease of comparison. The waveform 
amplitudes are scaled to correspond to core collapse events at 10 kpc. 
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Fig. 7. — Maximum cross-correlation between reconstructed waveforms and waveforms as- 
sociated with models that differ only by progenitor mass versus core collapse distance and 
SNR 2 . The SNR 2 shown is the average for the two detectors. The SNR 2 for the 4-km 
Hanford detector is 1.05 times that shown while the SNR 2 for the 4-km Livingston detector 
is 0.95 times that shown. The reconstructed waveform is inferred using maximum entropy 
from simulated detections that use a waveform from a model with a progenitor mass of 15 
solar masses (Ott et al. [2004] model sl5A1000B0.1) as the initial signal waveform as well 
as detector responses and noise levels from the fourth science run (S4). The inferred wave- 
form is most similar to that generated by the model with the same progenitor mass for the 
simulations corresponding to core collapse events that occur less than 2-3 kpc away. 
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Fig. 8. — Wave forms from mod els that differ only by progenitor mass. The waveform 
corresponding to lOtt et al.l (120041 ) model sl5Al000B0.1 was used as the initial signal in the 
detection simulations. The zero points of the time axes are chosen so that the minima of 
the waveforms occur at the same time for ease of comparison. The waveform amplitudes are 
scaled to correspond to core collapse events at 10 kpc. 
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Fig. 9. — Maximum cross-correlation between reconstructed waveforms and waveforms asso- 
ciated with models that differ only by rotation parameter (3, which is defined in equation (137|) . 
versus core collapse distance and SNR 2 . The SNR 2 shown is the average for the two detec- 
tors. The SNR 2 for the 4- km Hanford detector is 1.05 times that shown while the SNR 2 
for the 4-km Livingston detector is 0.95 times that shown. The reconstructed waveform 
is inferred using maximum entropy from simulated detections that use a waveform from a 
model with a rotation parameter of (3 = 0.1% (Ott et al. [2004] model sl5Al000B0.1) as the 
initial signal waveform as well as detector responses and noise levels from the fourth science 
run (S4). The inferred waveform is most similar to that generated by the model with the 
same /3 for the simulations corresponding to core collapse events that occur less than 2-3 kpc 
away. 
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Fig. 10. — Maximum cross- correlation between reconstructed waveforms and waveforms as- 
sociated with models that differ only by initial degree of differential rotation as parameterized 
by A, which is defined in equation ( 1381) . versus core collapse distance and SNR 2 . The SNR 2 
shown is the average for the two detectors. The SNR 2 for the 4- km Hanford detector is 1.05 
times that shown while the SNR 2 for the 4-km Livingston detector is 0.95 times that shown. 
The reconstructed waveform is inferred using maximum entropy from simulated detections 
that use a waveform from a model with a differential rotation parameter of A = 1,000 km 
(Ott et al. [2004] model sl5Al000B0.1) as the initial signal waveform as well as detector re- 
sponses and noise levels from the fourth science run (S4). The inferred waveform is the most 
similar to that generated by the model with the same initial degree of differential rotation. 
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Fig. 11. — Waveforms from models that differ only by rotation parameter /?, defined by 
equation (l37l) . The waveforms for larger (3 (> 0.4%) have significant amplitude over durations 
of hundreds of ms, while low (3 waveforms last only for tens of ms. The (3 = 0% waveform 
has very low amplitude as the non-ro t ating collapse is nearly spherically symmetric. The 
waveform corresponding to lOtt et al.l (120041 ) model sl5Al000B0.1 was used as the initial 
signal in the detection simulations. The zero point of the time axes for the plots on the right 
is chosen so that the onset of significant gravitational wave amplitude occurs at roughly the 
same time while the plots on the left show the first 800 ms of the waveform. The waveform 
amplitudes are scaled to correspond to core collapse events at 10 kpc. 
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Fig. 12. — Waveforms from models that differ only by initial degree of differential rotation. 
The differential rotation parameter, A, is the distance at which the rotational velocity of the 
progenitor drops to half the rotational velocity at its center. As A decreases the differential 
rotation of the progenitor becomes more extreme and the amplitu des of the gravit ational 
waves increase. The center plot shows the waveform corresponding to lOtt et al.l (12004 ) model 
sl5A1000B0.1 which was used as the initial signal in the detection simulations. The zero 
points of the time axes are chosen so that the minima of the waveforms occur at the same 
time for ease of comparison. The waveform amplitudes are scaled to correspond to core 
collapse events at 10 kpc. 



